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Abstract. By considering a least squares approximation of a given square 
integrable function /: [0, 1]" — > K by a shifted L-statistic function (a shifted 
linear combination of order statistics) , we define an index which measures the 
global influence of the kth largest variable on /. We show that this influence 
index has appealing properties and we interpret it as an average value of the 
difference quotient of / in the direction of the fcth largest variable or, under 
certain natural conditions on /, as an average value of the derivative of / in 
the direction of the kth largest variable. We also discuss a few applications of 
this index in statistics and aggregation theory. 



1. Introduction 

Consider a real- valued function f oi n variables xi, . . . ,Xn and suppose we want 
to measure a global influence degree of every variable xt on f . A reasonable way 
to define such an influence degree consists in considering the coefficient of Xi in the 
best least squares approximation of / by affine functions of the form 

n 



g{xi, ...,Xn)^CQ+y ^CjX 



This approach was considered in [SlII^ for pseudo-Boolean functions^ /: {0, 1}" — > 
M and in [9] for square integrable functions /: [0, 1]" — s- R. It turns out that, in 
both cases, the influence index of Xi on / is given by an average "derivative" of / 
with respect to Xi. 

Now, it is also natural to consider and measure a global influence degree of 
the smallest variable, or the largest variable, or even the kth largest variable for 
some fc S {1, . . . ,n}. As an application, suppose we are to choose an appropriate 
aggregation function /: [0, 1]" ~> M to compute an average value of [0, l]-valued 
grades obtained by a student. If, for instance, we use the arithmetic mean function, 
we might expect that both the smallest and the largest variables are equally influent. 
However, if we use the geometric mean function, for which the value (the left 
endpoint of the scale) is multiplicatively absorbent, we might anticipate that the 
smallest variable is more influent than the largest one. 
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Similarly to the previous problem, to define the influence of the fcth largest 
variable on / it is natural to consider the coefficient of X(^k) in the best least squares 
approximation of / by symmetric functions of the form 

n 

g{xi,...,x„) = ao + y^^aiXi^i), 

i=l 

where X(^i), . . . are the order statistics obtained by rearranging the variables 
in ascending order of magnitude. 

In this paper we solve this problem for square integrable functions / : [0, 1]" — > M. 
More precisely, we completely describe the least squares approximation problem 
above and derive an explicit expression for the corresponding influence index (§2). 
We also show that this index has several natural properties, such as linearity and 
continuity, and we give an interpretation of it as an average value of the difference 
quotient of / in the direction of the fcth largest variable. Under certain natural 
conditions on /, we also interpret the index as an average value of the derivative of 
/ in the direction of the fcth largest variable (§3). We then provide some alternative 
formulas for the index to possibly simplify its computation (§4) and we consider 
some examples including the case when / is the Lovasz extension of a pseudo- 
Boolean function (§5). Finally, we discuss a few applications of the index (§6). 

We employ the following notation throughout the paper. Let I" denote the n- 
dimensional unit cube [0, 1]". We denote by L^(I") the class of square integrable 
functions /: I" — ?> M modulo equality almost everywhere. For any S C [n] = 
{l,...,n}, we denote by I5 the characteristic vector of S in {0, 1}" (with the 
particular case — I0). 

Recall that if the I- valued variables xi, . . . , a;„ are rearranged in ascending order 
of magnitude X(^ij ^ • • • ^ 2^(n); then is called the kth order statistic and the 
function osfc : I" — > R, defined as osfc(x) = a:(fe), is the kth order statistic function. 
As a matter of convenience, we also formally define osq = and os„-|-i = 1. To 
stress on the arity of the function, we can replace the symbols X(^k) and os^ with 
Xk:n and 0Sfe:„, respectively. For general background on order statistics, see for 
instance [Hll]. 

Finally, we use the lattice notation A and V to denote the minimum and maxi- 
mum functions, respectively. 

2. Influence index for the fcxH largest variable 

An L-statistic function is a linear combination of the functions osi, . . . ,os„. A 
shifted L-statistic function is a constant plus an L-statistic function. Denote by 
Vl the set of shifted L-statistic functions. Clearly, Vl is spanned by the linearly 
independent set 

(1) B = {osi, . . . ,OS„,OS„+i} 

and thus is a linear subspace of L^(I") of dimension n + 1. For a given function 
/ G L^(I"), we define the best shifted L-statistic approximation of f as the function 
/l G Vl that minimizes the distance 

||/-gf = / (/(x)-.9(x))'dx 
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among all g G Vl, where || • || is the norm in associated with the inner product 

{fi9) — /in f{^)9{^)d'^- Using the general theory of Hilbert spaces, we immedi- 
ately see that the solution of this approximation problem exists and is uniquely 
determined by the orthogonal projection of / onto Vl ■ This projection is given by 

n+l 

(2) fL = Yl ' 

where the coefficients aj (for £ [n + 1]) are characterized by the conditions 

(3) (/ - /l , OS,:) = for all ie[n + l]. 

Consider the matrix representing the inner product in the basis ([T]), that is, the 
square matrix AI of order n+l defined by (M)ij = (oSi,oSj} for all i,j G [n + 1]. 
Denote also by b the {n + 1) x 1 column matrix defined by (b)i = (/, os^) for all 
i e [n + l] and by a the (n + l) x 1 column matrix defined by (a)j = Oj for all 
j £ [n + 1]. Using this notation, the unique solution of the approximation problem 
defined in ^ and ([3]) is simply given by 

(4) a A/"^b. 

To give an explicit expression of this solution, we shall make use of the following 
formula (see [3]). For any integers 1 ^ fci < • • • < fc„j ^ n and any nonnegative 
integers ci, . . . , Cm, we have 

^ ^ 1 1\ ^" + r c,) ! (A, - 1 + ElZl c.) ! ■ 

Lemma 1. For every i,j e [n + l], we have 

min(i, j)(max(i, j) + l) 



(6) (M)., 
and 



(n + l)(n + 2) 



(7) 



ifi^j<n + l, 
(M-'h _ J if J = n + l, 



(n + l)(n + 2) ]-l, tf\i-j\^l, 
, , otherwise. 



Proof. The formula for {M)ij — (oSi,0Sj) immediately follows from ([5|). The for- 
mula for {M^^)ij can be checked easily. □ 

Recall that the central second difference operator is defined for any real sequence 
(^fc)fc^i as S'^Zk = Zk+i — 2zfe + Zk-i. For every k G [n], define the function 
gk e L\r) as 

(8) 5fe^-(n+l)(n + 2)(5^0Sfc. 

Using (III) and (O, we immediately obtain the following explicit forms for the com- 
ponents of /l in the basis ([T]). 

Proposition 2. The best shifted L-statistic approximation f^ of a function f G 
L^(I") is given by 0), where 

if, 9k) , ifke [n], 

(n + 1)2(/, 1) - (n + l)(n + 2)(/, os„) , ifk = n + l. 



(9) ak 
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Now, to measure the global influence of the fcth largest variable X(^k) on an 
arbitrary function / G we naturally define an index /: x [n] — > M as 

/(/, fc) = flfe, where Ofc is obtained from / by ©• We will see in the next section 
that this index indeed measures an influence degree. 

Definition 3. Let /: L'^{r) x [n] R be defined as /(/, k) = {f,gk), that is 

(10) /(/, k) = -in + l)(n + 2) / /(x) x^k) dx. 

Remark 1. By combining ([5]) and ((TO)) . we see that the index I{f,k) can be easily 
computed when / is any polynomial function of order statistics. 

Thus we have defined an influence index from an elementary approximation 
(projection) problem. Conversely, the following result shows that the best shifted 
i-statistic approximation of / e L^(I") is the unique function of Vl that preserves 
the average value and the influence index. To this extent, we observe that letting 
i = n + 1 in ^ leads to (Jl, 1) — (/, 1), that is, 

n+l 

(11) -—J2kak = {f,l). 

Proposition 4. A function g Q Vl is the best shifted L-statistic approximation 
of f G L^(I") if and only if /j„ /(x) dx — /j„ g{x) dx and I{f, k) = I{g, k) for all 
A: e [n]. 

Proof. We formally extend /(/, ■) to [n + 1] by defining /(/, n + 1) = On+i. By ([3]), 
the function g S Vl is the best shifted L-statistic approximation of / e L^(I") if 
and only if (/, osi) = (5, os;) for all i S [n + 1]. By ([4]), this condition is equivalent 
to /(/, k) = I{g, k) for aU fc £ [n + 1]. We then conclude by (HTJ)- □ 

Remark 2. Combining ([2|) with (fTTj) . we can rewrite the best shifted L-statistic 
approximation of / as /l = (/, 1) + Y2=i ^if' k){x^k) - 

3. Properties and interpretations 

In this section we present various properties and interpretations of the index 
/(/, k). The first result follows immediately from Definition |31 

Proposition 5. For every k S [n], the mapping f H> I(f,k) is linear and continu- 
ous. 

We now present an interpretation of /(/, k) as a covariance. Considering the 
unit cube I" as a probability space with respect to the Lebesgue measure, we see 
that, for any k G [n], the index I{f,k) is the covariance of the random variables 
/ and gk. Indeed, we have /(/, fc) = E{f gk) = cov{f,gk) + E{f)E{gk), where 
E{gk) = {i,gk) — ~ 0. From the usual interpretation of the concept of 

covariance, we see that /(/, k) is positive whenever the values of / — E{f) and 
gk — E{gk) = gk have the same sign. Note that (?fe(x) is positive whenever X(^k) is 
greater than ^{x(^k+i) + ^{k-i)): which is the midpoint of the range of x^k) when 
the other order statistics are fixed at x. 

We now provide an interpretation of /(/, k) as an expected value of the derivative 
of / in the direction of the fcth largest variable (see Proposition [7]). 
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Let Sn denote the symmetric group on [n]. Recall that the unit cube I" can be 
partitioned almost everywhere into the open standard simplexes 

i;' = {x G r : < • • • < x^(„)} (tt e Sn). 

Definition 6. Given fc G [n], let /: UneSn IJJ — s> K be a function such that the 
partial derivative -D7r(fe)/|ij exists for every it G Sn- The derivative of f in the 
direction (k) is the function D^k)f: U7rGS„ I" R defined as 

I?(fc)/(x)=I?,(,)/(x) foraU x G 

Remark 3. By considering the chain rule in U^gs^I^ with the usual assumptions, 
we immediately obtain the formula 

n 

D(k)f{gi{x), ■ ■ • ,ffn(x)) = ^(A/)(gi(x), . . . ,5n(x)) D(k)gt{yi)- 

i=l 

Now, for every fc G [n], consider the function hk G defined as 

hk = {n + l)(n + 2)(oSfe+i - oSfc)(oSfc - 0Sfc_i). 

It is immediate to see that is nonnegative and continuous and that hk ~ —gk, 
where gk is defined in ([5]). Moreover, using ([S]) or ©, we easily see that hk is a 
probability density function on I" . This fact can also be derived by choosing / = osfc 
in the following result. 

Proposition 7. For every k G [n] and every f G L^(I") such that D(^k)f contin- 
uous and integrable on U^gs^I^, we have 

(12) I{f,k) = f /ifc(x)D(fc)/(x)dx. 

Proof. Fix k G [n]. Using the product rule, we obtain 

hki^) i?(fc)/(x) -i^(fc)(/ifc(x)/(x)) +gfc(x)/(x), 
and hence we only need to show that 

(13) / i?(fc)(/ifc(x)/(x))dx = 0. 



But the left-hand side of (1131) can be rewritten as 



E / i?.(fc)(/ifc(x)/(x))dx 

2^ ■■■ / ■D7r(fc)(^fc(x)/(x))da;^(i)dx^(2)---dx^(„), 

"'0 JQ Jo Jo 



that is, if we permute the integrals so that we integrate first with respect to a;7r(fc) , 
X! / ■■■ / ^7r(fc)(/ife(x)/(x)) dx^(fe) •■•da;„(i) 

which is zero since so is the inner integral. □ 

Remark 4. Under the assumptions of Proposition [71 if D(^k)f = (resp. ^ 0, ^ 0) 
almost everywhere, then /(/, fc) = (resp. ^0, < 0). 



6 



JEAN-LUC MARICHAL AND PIERRE MATHONET 



We now give an alternative interpretation of /(/, k) as an expected value, whicli 
does not require the additional assumptions of Proposition [T] In this more general 
framework, we naturally replace the derivative with a difference quotient. To this 
extent, we introduce some further notation. As usual, we denote by the zth 
vector of the standard basis for W\ For every k G [n] and every h G [0,1], we 
define the {k)- difference (or discrete (k)- derivative) operator ^{k),h over the set of 
real functions on I" by 

A(fc)./i/(x) = /(x + /ie^(fe))-/(x) 

for every x € I" such that x + /ie^(j,-) G I^. Thus defined, the value A(j,-)j,/(x) can 
be interpreted as the marginal contribution of x^j,) on / at x with respect to the 
increase h. For instance, we have ^(k),h ^(k) = h. 

Similarly, we define the (k)- difference quotient operator Q[k),h over the set of 
real functions on I" by Q(k).hf{^) = i^{k).hfi^)- 

Theorem 8. For every k G [n] and every f G i^(I"), we have 

(14) /(/, k)^{n + l){n + 2) f f A(fe),,_,^^, /(x) dydx. 

Proof. The right-hand side of ([Ti)) can be rewritten as 

(15) (n + l)(n + 2) ^ / / (/(x+(y-.T 

On the one hand, we have 

(16) / / f{x)dydx^ (a;^(fc+i) - x^(fc))/(x) dx. 

On the other hand, by permuting the integrals exactly as in the proof of Proposi- 
tion [3 we obtain 

x-^{k + l} 

/(x + (y - x^^k) ) e^(fc) ) dy dx 

(k) 

/ f{^+{y~ Xyr{k) ) e^(fc) ) dy dx^{k) ■ ■ ■ dx^(i) 

which, by permuting the two inner integrals, becomes 



IZ Jx 







x„ik+i) rv 

/ /(x +{y- x^i^k)) e^{k)) dx^(^k) dy--- da;^(i) 

„(k-l) •'x^^k-l) 

x^{k + l) 



1 



{y - x„(k-i) )/(x + (y — e^(^k)) dy - - - dx^^^^iy 



r(fc-l) 

By renaming y as XT^(^k) i we finally obtain 

r rx^ik+i) r 
(17) / / f{x+{y-x^(k))eT,{k))dydx^l (x^(fc) - a;„(fe_i))/(x) dx. 

By substituting (HH) and ([TT]) in ([TS]), we finally obtain /(/, k). □ 

As an immediate consequence of Theorem[8l we have the following interpretation 
of the index /(/, k) as an expected value of a difference quotient with respect to 
some distribution. 
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Corollary 9. For every k G [n] and every f E we have 

Hf^k)^ / pk{y.,y)Q(k),y~Xf^-^f{y-)dydy., 

where pfe(x, j/) = (71 + \){n + 2)(y — defines a probability density function on 

the set {(x,2/) : x e I", y e [x(^k),X(^k+i)\}- 

Another important feature of the index is its invariance under the action of 
permutations. RecaU that a permutation it Sn acts on a function / : I" — > M 
by T:{f){xi, . . . , Xn) — f{x.,r(i), . . . , By the change of variables theorem, we 

immediately see that every tt e S'„ is an isometry of L^(I"), that is, {TT{f),n{g)) = 
{f,g)- From this fact, we derive the following result. 

Proposition 10. For every f G L^(I") and every n £ 5„, both functions f and 
Tr{f) have the same best shifted L-statistic approximation f^. Moreover, we have 

Mf) - fd ^ \\f - fd- 

Proof. Let / G g G Vl, and tt G Sn- Since n is an isometry of 

and g is symmetric, by (jSj we have (7r(/),g) — {f,g) — {fL,g), which shows that 

'^{f)L = /l- Using similar arguments, we obtain 

h{f)-fL\? = {n{f)-fL,n{f)-fL) = {f-fLj~fL) 

= Wf-hf, 

which completes the proof. □ 
With any function / : I" — > M we can associate the following symmetric function 

Sym(/) = ^J2 

It follows immediately from PropositionslSlandlTOlthat both functions / and Sym(/) 
have the same best shifted L-statistic approximation f^. Combining this observa- 
tion with Proposition 1101 we derive immediately the following corollary. 

Corollary 11. For every fc G [n], every f G and every tt G Sn, we have 

I{f,k) = I{^{f),k) = I{SYui{f),k). 

Remark 5. Corollary [TT] shows that, to compute I(f,k), we can replace / with 
Sym(/). For instance, if /(x) = Xi for some i G [n] then Sym(/) = i X)r=i ^« = 
i X^iLi ^{i} ^^'^ hence, using Proposition [71 we obtain /(/, fc) = i. 

We say that two functions / : I" — M and g : I" — R are symmetrically equivalent 
(and we write f ^ g) if Sym(/) = Sym((7). By Corollary [TTl for any f,g G 
such that / ~ g, we have /(/, k) = I{g, k). 

We end this section by analyzing the behavior of the influence index /(/, k) on 
some special classes of functions. 

Given A; G [n], we say that the order statistic ^(j.) is ineffective almost everywhere 
for a function /: I" — ^ M if A(j.) ^.^^^^^ /(x) = for almost all x G U7reS„I?J and 
almost all y G [. For instance, given unary functions /i,/2 G ^^(I), 

the order statistic is ineffective almost everywhere for the function / : ^> R 
such that 

r, ^ ] fl{xi), if Xl > X2, 

f{xi,X2)=< 

J2[X2), if Xl < X2- 
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The following result immediately follows from Theorem |8l 

Proposition 12. Let k G [n] and f G If X(k) is ineffective almost every- 

where for f , then I{f, k) = 0. 

The dual of a function / : I" ^ M is the function : I" ^ M defined by /'^(x) = 
1 — /(l[n] — x). A function /: I" — !■ R is said to be self-dual if = f. By using 
the change of variables theorem, we immediately derive the following result. 

Proposition 13. For every f e and every k £ [n], we have I{f'^,k) = 

/(/, n — k + 1). In particular, if f is self-dual, then I{f, k) =^ I{f, n — k + I). 



4. Alternative expressions for the index 

The computation of the index /(/, k) by means of PU)) or might be not 
very convenient due to the presence of the order statistic functions. To make those 
integrals either more tractable or easier to evaluate numerically, we provide in this 
section some alternative expressions for the index /(/, k) that do not involve any 
order statistic. 

We first derive useful formulas for the computation of the integral (/, osk) (Propo- 
sition [IT]). To this extent, we consider the following direct generalization of order 
statistic functions. 

Definition 14. For every nonempty S = {ii, . . . ,is} C [n], s = \S\, and every 
/c G [s], we define the function 0Sfe:5 : I" ^» R as 0Sfe:5(x) = oSk:s{xii , . . . ,XiJ. 

To simplify the notation, we will write Xk-.s for osj..5(x). Thus x^-.s is the kth 
order statistic of the variables in S. 

Lemma 15. For every s G [n] and every fc G [s], we have 

E--i:(r-;)(::^ 

SC[n] j=k ^ ^ ^ 

\S\=s 

Proof. Since both sides of ([T5)) are symmetric and continuous functions on I", we 
can assume Xi < • • • < x„. Then, for every j G [n], we have x^-s = xj if and only 
ii S B j and jS* fl [j — 1]| = k — 1. The result then follows by counting those sets S 
of cardinality s and having these two properties. □ 

Lemma 16. For every fc G [n], we have 

\S\ - 1 

X\S\:S 



(19) Xk:n = E 



SC[n] 

\S\^k 



(20) x,..^ = (-l)'"-"+'-'('^L/)-i^^ 

SC[n] ^ " ^ 
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Proof. By using (fTSi) . we can rewrite the right-hand side of ([T9l) as 

s=fc ^ ^ SC[n] s=k ^ ^ 3=s ^ 

\S\=s 

j=k ^ ' s=k ^ 

where the inner sum equals (1 - I)-''"''. This proves (HH). Identity ^ then follows 
by dualization. □ 

We now provide four formulas for the computation of the integral Jj„ /(x) X(^k) dx. 
From these formulas we will easily derive alternative expressions for the index 
Iif,k). 

Proposition 17. For every function f G L^(I") and every k G [n], the integral 
Jk:n — Jjn /(x) dx is given by each of the following expressions: 

(21) /,„ /(x) dx - E5CN:|5|> /o /[o,,]- /[o,i]H\s /(x) dxdy 

(22) E5CN:|5|>.-fe+i(-l)l"l-"+'-^('f^.') /o 4,1]- /[o,i]H\s /(x) dxdy 

(23) /j„ /(x) dx - Escin]:\s\^k la I[o,y]^ I[y,i]M\s /(x) ^x dy 

(24) E5 C[n]:|S|<fe /o /[o,y]s4,i]H\s/(x)dxd?; 

Proof. By linearity of the integrals, we can assume that / has nonnegative values. 
Then, we define the measure /i/ as ^J,f{A) = /(x) c?x for every Borel subset A of 
I". To compute integral Jk-.n, we can use Lemma [TBI and compute only the integrals 
■^151:5 = Jin /(x) Vigs^^i c^x and Ji:5 = Jj„ /(x) Aies^i dx. To this extent, wc define 

F|s|:5(y) = M/({x e r : V,es2:. ^ 2/}) = / / /(x)dx 

J[0,y]^ J[0,1]H\S 

and 

Fi:s{y) - M/({xer :A,e52:. s^y}) - M/(I")-M/({xer :A,esx, >y}) 



/ /(x)dx- / / /(x)dx 

Jl" J[v,l]S J[0,1]M\S 



We then have 

J\S\:S ^ / yiesxidfif^ / ?;di^|s|:s(y) = lim ■F'|s|:s(y) - / F\s\:siy)dy 



and similarly for Ji;S- This proves pTt and ((22)) . 
We prove similarly by considering 

Fk:n{y) = M/({xer sCy}) 

= M/(U|s|^/c{x e I" : Vig5Xi < y < Aie[n]\sXt}) 



/ /(x)rfxd2;. 
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Finally, we prove ([24|) by observing that 

/ /(x)dx = = M/(U|s|^o{x e I" : V.gsx, < 2/ < A,;g[„]\sa;J) 

= E / / / /(x)dxd2/. □ 

Remark 6. In the special case when / is a probability density function on I", 
the integral Jj„ /(x) xi^k) dx is precisely the expected value Ef{Xk:n), which is well 
investigated in statistics (see [Hll])- 

From Definition [3] and Proposition 1171 we derive the following three formulas. 
The computations are straightforward and thus omitted. 

(25(]^+i)(„+2) ^ I^sc[n]:|S|>fc-i(^l)''^'^"^ '°(''^L^^)/o /[o,a]s /[o,i]H\s /(x) rfxdy 

(?rP4lKn+2) ^ SsC[n]:|S|5;«-fc(~l)''^' ^ (j^'fc+l) /o /[0,l]["l\s /W dx 

(2%Tiy(M^ ^ (I]sC[n]:|S|=/c-l ^I]sC[n]:|S| = fe) /o I[0,y]S Ily,l]M\s f i^) dy . 

5. Some examples 

We now apply our results to two special classes of functions, namely the mul- 
tiplicative functions and the Lovasz extensions of pseudo-Boolean functions. The 
latter class includes the so-called discrete Choquet integrals, well-known in aggre- 
gation function theory. 

5.1. Multiplicative functions. Consider the function /(x) = Y['i=i where 
i^i e and set $i(a;) — fi{t) for i = 1, . . . , n. By using we obtain 

^ SC[ri] ^ ^ ie[n]\S " »es 

|S|^fc-l 

The following result gives a concise expression for /(/, k) when / is symmetric. 

Proposition 18. Let /: I" — ;> R be given by /(x) = Y[i=i vi^i)' where ip G i^(It), 
and let — (p{t) dt. Then, for every k G [n], we have 

'$(1)" DMz; fc + 1, n - fc -f 2)U^$(^)/$(i) dy, ij $(1) ^ 0, 

_ (-l)«-'=+i (n + 1) p(,j;rg;c+2) /o ^2/' ^/ = 0' 

where h(z;a,b) = z"^^(l — zf'^^ / B{a^h) is the probability density function of the 
beta distribution with parameters a and b. 

Proof. Suppose that $(1) ^ 0. By using (|27|) . we obtain 



(n+ l)(n + 2) 



E - E )/ <i>(2/)i^i(*(i)-*(y))""'^'rfy 



SC[n] SC[n 
|5| = fc-l |S|=fe 



= $(1)" ( (^^, ^ (1 - - Q z'=(l - z)"-'^) 

which proves the result. The case $(1) = follows from ([^5)). □ 



dy, 

z='S>{y)/<f{l) 
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Example 19. Let /: I" ^ M be given by /(x) ^ {iTLi^iT^ '^'^ere c > -i. 
For instance, the product function corresponds to c = 1 and the geometric mean 
function to c = 1/n. We can calculate /(/, k) by using Proposition [TH] with (p{x) = 
x'^. Using the substitution z = y'^^^ and then integrating by parts, we obtain 

1 ^n+2 r(n + 3)r(fc-i + ^) _ nk-i+^ 
'"^^ - "\c + i) r(fc + i)r(n + i + ^) r(fc + i)r(^)'^^''^' 

with 



We observe that /(/, fc) — > /(/, 1) as c — !■ — ^. Also, for c > 0, we have /(/, fc + 1) < 
/(/, fc) for every fc G [n — 1]. As expected in this case, the smallest variables are 
more influent on / than the largest ones. 

5.2. Lovasz extensions. Recall that an n-place {lattice) term function p: I" — > I 
is a combination of projections x i— >■ (z S [n]) using the fundamental lattice 
operations A and V; see [2- For instance, 

p{xi,X2,xz) = {xi A 2:2) \J Xz 

is a 3-place term function. Note that, since I is a bounded chain, here the lattice 
operations A and V reduce to the minimum and maximum functions, respectively. 
Clearly, any shifted linear combination of n-place term functions 

m 

/(x) = co + ^CiPj(x) 

1=1 

is a continuous function whose restriction to any standard simplex I" (tt G Sn) is a 
shifted linear function. According to Singer [111 §2], / is then the Lovasz extension 
of the pseudo-Boolean function /|{o,i}", that is, the continuous function /: I" — > M 
which is defined on each standard simplex as the unique affine function that 
coincides with /|{o,i}" at the n + 1 vertices of I". Singer showed that a Lovasz 
extension can always be written as 

n 

(29) /(x) = /„\i + 5](/f -/r+i)a;^(.) (x £ I^J), 

1=1 

with /f /(l{7r(i),...,7r(n)}) = ^ f {{tt {%),..., -R {n)}) for i e [n+ 1], where the set 
function Vf : 2["1 — > M is defined as Vf{S) — /(Is). In particular, /^^^ — c^ — /(O). 
Conversely, any continuous function /: I" — > M that reduces to an affine function 
on each standard simplex is a shifted linear combination of term functions: 

(30) /(x)= ^ m;(5)xi:5, 

SC[n] 

where to/ : 2l"l — R is the Mobius transform of Vf, defined as 

TCS 



Indeed, expression (j30p reduces to an affine function on each standard simplex 
and agrees with /(Is) at Is for every S C [n]. Thus the class of shifted linear 
combinations of n-place term functions is precisely the class of n-place Lovasz 
extensions. 
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Remark 7. A nondecreasing Lovasz extension /: I" — > M such that /(O) = is also 
called a discrete Choquet integral. For general background, see for instance [5]. 

For every nonempty S C [n] and every k G [\SW^ the function 0Sfe:S is a Lovasz 
extension and, from (jl9l) . we have 



TC5 



The following proposition gives a concise expression for the index I{oSj.s, k). We 
first consider a lemma. 

Lemma 20. For every nonempty S C [n] and every j G [\S\], we have 



Sym(oSj.5) = J2 '^^o-T- 



^\S\J TC[«] 
\T\ = \S\ 

Proof. It is easy to see that Sym(oSj:s)(x) — J2Ties„ 0Sj:7r(s)(x). This proves the 
result for there are exactly \S\l{n — \S\y. permutations that map 5 to a given set 
T C [n] such that |T| = IS*]. □ 

Proposition 21. For every nonempty S C [n], every j G [\S\], and every k G [n], 
we have 

(31) -r(0Sj:S, fc) = 

\\S\) 

if ^ k — j ^ n — \S\, and 0, otherwise. 

Proof. The result follows from Corollary [11] Lemma [151 and Lemma I20H □ 

The following proposition gives an explicit expression for the index /(/, k) when 
/ is a Lovasz extension. 

Proposition 22. // / : I" — R is a Lovasz extension, then 

n 

(32) /(x) = /(0) + Ex(,)i?(,)/(x). 

i=l 

Moreover, for every G [n], we have 

n—k+l , ,^ 

(33) I{f,k)^Vf{n-k+l)~Vf{n~k)^ ^ (^Ilj^/(^)' 

where Vf{s) = C)"^ Esc[„]:|5|=s "/(-S") andrfif{s) = (")"^ Esc[„]:|S|=s™/('S')- 

Proof. Let / : I" ^ M be a Lovasz extension and let fc G [n]. For every tt G Sn and 
every x G from 1^ if follows that D^(fc)/(x) = - This estahshes 



^The right-hand side of JSTJ can also be viewed as the multivariate hypergeometric distribution 

k — l\ { n — k \ / ( -n, \ 
j-l)\\S\-j)/\\S\) 



(l) (^-l) i\'si- )/(\S\) obtained directly from the proof of Lemma [Tsl 
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By Proposition [71 we then obtain 

/(/,fc)= V / /lfc(x)I?,(fe)/(x)dx 



= {n + l){n + 2) ^ (/^ - / (x^(fc+i) - 2;^(fc))(x^(fe) - dx 

Since the integral is equal to l/(n + 2)!, we obtain /(/, k) — ^ Y^Tres^ifk ~ fk+i)^ 
which, after some algebra, leads to the first equality in Finally, by combining 
Proposition [5] with ([50)1 and (for j = 1), we completely establish (155)) . □ 



Remark 8. A function /: I" — > M solves equation (|32|) if and only if, for every 
TT G 5™, the function f\p^ — /(O) is an eigenfunction of the Euler operator with 
eigenvalue 1. Thus this function reduces to a homogeneous function of degree 1 
whenever it is differentiable. Notice however that such a function need not be linear 
even if / is continuous on I". For instance, the geometric mean /(x) = YYi=i ^1^" 
is a continuous function solving ([32)) . 



We can readily see that the shifted L-statistic functions are precisely the sym- 
metric Lovasz extensions. From this observation we derive the following result. 

Proposition 23. For any Lovasz extension /: I" — )■ K, we have Jl = Sym(/) and 

n 

Sym(/) = /(0) + ^/(/,z)os,. 

i=l 

Proof. Let / : I" — > M be a Lovasz extension. Then Sym(/) is a symmetric Lovasz 
extension or, equivalently, a shifted L-statistic function. By Propositions [5l and 
[To) we have — Sym(/)L — Sym(/). The result then follows since /l(0) — 
Sym(/)(0) = /(O). ^ □ 



6. Applications 

We briefly discuss some applications of the infiuence index in aggregation theory 
and statistics. We also introduce a normalized version of the index as well as the 
coefficient of determination of the approximation problem. 

6.1. Infiuence index in aggregation theory. Several indexes (such as interac- 
tion, tolerance, and dispersion indexes) have been proposed and investigated in 
aggregation theory to better understand the general behavior of aggregation func- 
tions with respect to their variables; see [5] Chap. 10]. These indexes enable one 
to classify the aggregation functions according to their behavioral properties. The 
index /(/, fc) can also be very informative and thus contribute to such a classifica- 
tion. As an example, we have computed this index for the arithmetic mean and 
geometric mean functions (see Remark [5] and Example I19p and we can observe for 
instance that the smallest variable a;(i) has a larger influence on the latter function. 

Remark 9. Noteworthy aggregation functions are the so-called conjunctive aggre- 
gation functions, that is, nondecreasing functions /: I" ^> M satisfying ^ /(x) ^ 
2^(1); see O Chap. 3]. Although these functions are bounded from above by 
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the index I(f,k) need not be maximum for k = 1. For instance, for the binary 
conjunctive aggregation function 



0, ifa:iVa;2<|, 
xi A X2 A J, otherwise. 



we have /(/, 1) = and /(/, 2) = if, and hence /(/, 1) < /(/, 2). 

In the framework of aggregation functions, it can be natural to consider and 
identify the functions / G for which the order statistics are equahy influent, 

that is, such that /(/, fc) ~ I{fA) for all k G [n]. As far as the Lovasz exten- 
sions are concerned, we have the following result, which can be easily derived from 
Proposition [21] and the immediate identities 



Proposition 24. // / : I" — > M is a Lovasz extension, then the following are equiv- 
alent. 

(a) We have I{f, k) = I{f, 1) for all k e [n]. 

(b) The sequence {vf{s))"^Q is in arithmetic progression. 

(c) We have rnf{s) = for s = 2, . . . ,n. 

6.2. Influence index in statistics. It can be informative to assess the influence 
of every order statistic on a given statistic to measure, e.g., its behavior with re- 
spect to the extreme values. From this information we can also approximate the 
given statistic by a shifted L-statistic. Of course, for L-statistics (such as Win- 
sorized means, trimmed means, linearly weighted means, quasi-ranges, Gini's mean 
difference; see ^ §6.3, §8.8, §9.4]), the computation of the influence indexes is 
immediate. However, for some other statistics such as the central moments, the 
indexes can be computed via (p5)) ~ (l27t . 



Example 25. The closest shifted L-statistic to the variance — - Y^"^i{Xi — X)"^ 
is given by 

-1 2 " 

with I{a^,k) = (n + 2)(2fc - n - l)/{n'^{n + 3)), which can be computed from 
(1271)11 We then immediately see that the smallest and largest variables are the 
most influent. 

6.3. Normalized index and coefficient of determination. Coming back to the 
interpretation of the influence index as a covariance (see §3), it is natural to consider 
the Pearson correlation coefficient instead of that covariance. In this respect, we 
note that a'^igk) = E{gf.) = I{gk, k) — 2(n + l){n + 2), where the latter inequality 
is immediate since gk ^Vl- 

Definition 26. The normalized influence index is the mapping 
r : {/ e L^(I") : / is non constant} x [n] R 



■^In terms of Gini's mean difference 4. §9.4], G = ^, ^^—iC^k — n — l)X^^.y we simply 



n(n~ 1) 

Obtain 4 = j^^^ (6(n + 2) G - {n + 1)) 
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defined by 

a{f) V2(n+l)(n + 2) 

From this definition it follows that — 1 ^ fifjk) ^ 1, where the bounds are 
tight. Moreover, this index remains unchanged under interval scale transformations, 
that is, r{af + b,k) — r{f, k) for all a > and 6 e M. Finally, we also have 
r{f'^,k) =r{f,n- k + 1). 

The coefficient of determination of the best shifted L-statistic approximation of 
a non constant function / G L^(I") is defined by B?{f) — cr^(/L)/o'^(/). We then 
have 

1 /"^^ \ 1 

^^(/) = = ^^^^(^^-^^^)**' 

where c is the {n + l)st column of M . 
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